InFi: End-to-End Learning to Filter Input for Resource-Efficiency in Mobile-Centric Inference

نویسندگان

چکیده

Mobile-centric AI applications have high requirements for the resource-efficiency of model inference. Input filtering is a promising approach to eliminate redundancy so as reduce cost Previous efforts tailored effective solutions many applications, but left two essential questions unanswered: (1) theoretical filterability an inference workload guide application input techniques, thereby avoiding trial-and-error resource-constrained mobile applications; (2) xmlns:xlink="http://www.w3.org/1999/xlink">robust discriminability feature embedding allow be widely diverse tasks and content. To answer them, we first formulate problem theoretically compare hypothesis complexity models filters understand optimization potential. Then propose end-to-end learnable framework that covers most state-of-the-art methods surpasses them in embedding with robust discriminability. We design implement xmlns:xlink="http://www.w3.org/1999/xlink">InFi supports different modalities mobile-centric deployments. Comprehensive evaluations confirm our theoretical results show outperforms strong baselines applicability, accuracy, efficiency. can achieve 8.5× throughput save 95% bandwidth, while keeping over 90% video analytics on platforms.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

End-to-end esophagojejunostomy versus standard end-to-side esophagojejunostomy: which one is preferable?

 Abstract Background: End-to-side esophagojejunostomy has almost always been associated with some degree of dysphagia. To overcome this complication we decided to perform an end-to-end anastomosis and compare it with end-to-side Roux-en-Y esophagojejunostomy. Methods: In this prospective study, between 1998 and 2005, 71 patients with a diagnosis of gastric adenocarcinoma underwent total gastrec...

متن کامل

Comparison of nerve repair with end to end, end to side with window and end to side without window methods in lower extremity of rat

  Abstract   Background : Although, different studies on end-to-side nerve repair, results are controversial. The importance of this method in case is unavailability of proximal nerve. In this method, donor nerves also remain intact and without injury. In compare to other classic procedures, end-to-side repair is not much time consuming and needs less dissection. Overall, the previous studies i...

متن کامل

Fast End-to-End Trainable Guided Filter

Image processing and pixel-wise dense prediction have been advanced by harnessing the capabilities of deep learning. One central issue of deep learning is the limited capacity to handle joint upsampling. We present a deep learning building block for joint upsampling, namely guided filtering layer. This layer aims at efficiently generating the highresolution output given the corresponding low-re...

متن کامل

FIND: Service-Centric End-to-End Abstractions in Network Architectures

Project Summary Next-generation network architectures will be governed by the need for flexibility. Heterogeneous end-systems, novel communication abstractions, and security and manageability challenges will require networks to provide a broad range of services that go beyond the simple store-and-forward capabilities of today's Internet. The proposed work introduces new abstractions for informa...

متن کامل

JEJUNAL EVERSION MUCOSECTOMY AND INVAGINATION: AN INNOVATIVE TECHNIQUE FOR THE END TO END PANCREATICOJEJUNOSTOMY

 ABSTRACT Background: The pancreatojejunostomy has notoriously been known to carry a high rate of operative complications, morbidity and mortality, mainly due to anastomotic leak and ensuing septic complications. Objective: In order to decrease anastomotic leak and its attendant morbidity and mortality in operations requiring a pancreato-jejunal anastomosis, and also in order to simplify the op...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Mobile Computing

سال: 2023

ISSN: ['2161-9875', '1536-1233', '1558-0660']

DOI: https://doi.org/10.1109/tmc.2023.3275981